A Hybrid Quasi-Harmonic/CELP Wideband Speech Coding Scheme for Unit Selection TTS Synthesis

نویسندگان

  • Chang-Heon Lee
  • Olivier Rosec
  • Yannis Stylianou
چکیده

This paper suggests a new wideband speech coding model to efficiently compress acoustic inventories for concatenative unit selection text-to-speech (TTS) synthesis system. To fulfill the requirements of TTS synthesizer such as partial segment decoding and random access capability, a non-predictive scheme was adopted which combines the adaptive Quasi-Harmonic Model (aQHM) with the innovative codebook (ICB) model. aQHM plays a major role in modeling pitch harmonic components, and ICB compensates, in a closed-loop way, for the modeling error of aQHM. This is especially important in transient or unvoiced regions. To further improve the coding efficiency, a hybrid coding framework is also suggested. Results from a large French speech database show that the proposed algorithm provides similar speech quality to the high quality AMR-WB codec while it supports the random access capability.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

High quality coding of wideband speech at 24 kbit/s

This paper proposes a Wideband-CELP-Coding scheme (bandwidth 7kHz) at 24 kbit/s. The codec introduces a delay of just 10 ms. This fulfills the requirements of a possible codec candidate for wideband speech coding within DECT or video applications [I]. The analysis-by-synthesis structure of the proposed Wideband-CELP-Codec includes an alternative LPC analysis concept, where the autocorrelation f...

متن کامل

A multi-band CELP wideband speech coder

A novel low-delay wideband speech coder, called Multiband CELP (MB-CELP), overcomes the major obstacles usually associated with two traditional CELP approaches to wideband speech coding namely fullband CELP and split-band CELP. The new MB-CELP coder employs a multi-band bank of o -line ltered excitation codebooks, fullband linear prediction synthesis, and minimization of the error between origi...

متن کامل

Efficient harmonic-CELP based hybrid coding of speech at low bit rates

This paper presents an efficient Harmonic-CELP hybrid coder at 2.4 kbps utilizing the well-known characteristics of the Harmonic and CELP coders. According to frame voicing decision, the proposed hybrid coder switches the RP-VSELP coder as a fast CELP in case of unvoiced, or an improved Harmonic coder in case of voiced. The proposed Harmonic-CELP hybrid coder has several features as follows: fa...

متن کامل

A hybrid sub-band sinusoidal coding scheme

This paper describes a hybrid sub-band speech coding scheme based on sinusoidal coding and CELP. Purely voiced speech is encoded using sinusoidal coding techniques and phase information is selectively transmitted. For mixed and unvoiced speech, the lower band is processed by sinusoidal coding algorithms while the upper band is encoded using CELP. To accommodate the extra bandwidth required by t...

متن کامل

High quality speech synthesis using a small speech dataset

We propose an approach to synthesizing high-quality speech under the conditions of a small dataset. A robust method for solving this problem is vital for voice restoration (recreation of lost fragments of records based on available speech material of a well-known person, e.g. an actor). The proposed TTS system is a hybrid system which includes the advantages of both HMMand Unit Selection-based ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011